Protecting a Moving Target: Addressing Web Application Concept Drift
نویسندگان
چکیده
Because of the ad hoc nature of web applications, intrusion detection systems that leverage machine learning techniques are particularly well-suited for protecting websites. The reason is that these systems are able to characterize the applications’ normal behavior in an automated fashion. However, anomaly-based detectors for web applications suffer from false positives that are generated whenever the applications being protected change. These false positives need to be analyzed by the security officer who then has to interact with the web application developers to confirm that the reported alerts were indeed erroneous detections. In this paper, we propose a novel technique for the automatic detection of changes in web applications, which allows for the selective retraining of the affected anomaly detection models. We demonstrate that, by correctly identifying legitimate changes in web applications, we can reduce false positives and allow for the automated retraining of the anomaly models. We have evaluated our approach by analyzing a number of real-world applications. Our analysis shows that web applications indeed change substantially over time, and that our technique is able to effectively detect changes and automatically adapt the anomaly detection models to the new structure of the changed web applications.
منابع مشابه
Mining Concept Drift from Data Streams by Unsupervised Learning
Mining is involved with knowing the unknown characteristics from the databases or gaining of Knowledge (Knowledge Discovery) from Databases to get more useful information from the database. Real time databases which are constantly changing with time, there may arise a point when traditional Data Mining techniques may not be adequate as there may be a previously unknown class label involved or n...
متن کاملConcept drift detection in event logs using statistical information of variants
In recent years, business process management (BPM) has been highly regarded as an improvement in the efficiency and effectiveness of organizations. Extracting and analyzing information on business processes is an important part of this structure. But these processes are not sustainable over time and may change for a variety of reasons, such as the environment and human resources. These changes ...
متن کاملReal-time Learning when Concepts Shift
We are interested in real-time learning problems where the underlying stochastic process, which generates the target concept, changes over time. We want our learner to detect when a change has occurred, thus realizing that the learned concept no longer fits the observed data. Our initial approach to this problem has been to analyze offline approaches to addressing concept shifts and to apply th...
متن کاملConcept drift and how to identify it
This paper studies concept drift over time. We first define the meaning of a concept in terms of intension, extension and label. Then we study concept drift over time using two theories: one based on concept identity and one based on concept morphing. A qualitative toolkit for analysing concept drift is proposed to detect concept shift and stability when concept identity is available, and conce...
متن کاملConcept drift learning and its application to adaptive information filtering
Concept Drift Learning and Its Application to Adaptive Information Filtering. (December 2003) Dwi HendratmoWidyantoro, B.S., Institut Teknologi Bandung; M.S., Texas A&M University Co-Chairs of Advisory Committee: Dr. John Yen Dr. Thomas R. Ioerger Tracking the evolution of user interests is a problem instance of concept drift learning. Keeping track of multiple interest categories is a natural ...
متن کامل